Modeling the use of durational information in human spoken-word recognition.

نویسنده

  • Odette Scharenborg
چکیده

Evidence that listeners, at least in a laboratory environment, use durational cues to help resolve temporarily ambiguous speech input has accumulated over the past decades. This paper introduces Fine-Tracker, a computational model of word recognition specifically designed for "tracking" fine-phonetic information in the acoustic speech signal and using it during word recognition. Two simulations were carried out using real speech as input to the model. The simulations showed that the Fine-Tracker, as has been found for humans, benefits from durational information during word recognition, and uses it to disambiguate the incoming speech signal. The availability of durational information allows the computational model to distinguish embedded words from their matrix words (first simulation), and to distinguish word final realizations of [s] from word initial realizations (second simulation). Fine-Tracker thus provides the first computational model of human word recognition that is able to extract durational information from the speech signal and to use it to differentiate words.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using durational cues in a computational model of spoken-word recognition

Evidence that listeners use durational cues to help resolve temporarily ambiguous speech input has accumulated over the past few years. In this paper, we investigate whether durational cues are also beneficial for word recognition in a computational model of spoken-word recognition. Two sets of simulations were carried out using the acoustic signal as input. The simulations showed that the comp...

متن کامل

Durational information in word-initial lexical embeddings in spoken Dutch

There is a growing body of research showing the importance of durational information for the disambiguation of temporarily ambiguous speech due to lexical embedding (e.g., rye in rises) in laboratory settings. The current research investigates whether durational differences are present in nonlaboratory speech. We focus on two types of speech: read speech and speech taken from interviews. Durati...

متن کامل

Internet Documents: A Rich Source for Spoken Language Modeling

Spoken language speech recognition systems need better understanding of natural spoken language phenomenon than their dictation counterparts. Current language models are mostly based on written text and/or very tedious Wizard of Oz or real dialog experiments. In this paper we propose to use Internet documents as a very rich source of information for spoken language modeling. Through detailed ex...

متن کامل

Asymmetric processing of durational differences – Electrophysiological investigations in Bengali

Duration is used contrastively in many languages to distinguish word meaning (e.g. in Bengali, [pata] 'leaf' vs. [pat:a] 'whereabouts'). While there is a large body of research on other contrasts in speech perception (e.g. vowel contrasts and consonantal place features), little work has been done on how durational information is used in speech processing. In non-linguistic studies of low-level ...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • The Journal of the Acoustical Society of America

دوره 127 6  شماره 

صفحات  -

تاریخ انتشار 2010